Architecture 4

CoAtNet: Marrying Convolution and Attention for All Data Sizes

Updated: August 24, 2021

5 minute read

Not All Images are Worth 16x16 Words: Dynamic Vision Transformers with Adaptive Sequence Length

Updated: July 8, 2021

5 minute read

Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet

Updated: July 2, 2021

4 minute read

Swin Transformer: Hierarchical Vision Transformer using Shifted Windows

Updated: June 29, 2021

5 minute read

Back to top ↑

Unsupervised Learning 1

Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning(2020)

Updated: July 29, 2020

4 minute read

Back to top ↑

Theory 5

CoAtNet: Marrying Convolution and Attention for All Data Sizes

Updated: August 24, 2021

5 minute read

Early Convolutions Help Transformers See Better

Updated: August 13, 2021

3 minute read

Sharpness-Aware Minimization for efficiently improving generalization

Updated: August 10, 2021

6 minute read

When Vision Transformers Outperform ResNets without Pretraining or Strong Data Augmentations

Updated: July 30, 2021

5 minute read

Deep Double Descent: Where Bigger Models and More Data Hurt

Updated: June 8, 2021

5 minute read

Back to top ↑

Semi-supervised 4

Unsupervised Data Augmentation for Consistency Training(2019)

Updated: January 14, 2021

3 minute read

Virtual Adversarial Training: A Regularization Method for Supervised and Semi-Supervised Learning(2017)

Updated: January 13, 2021

3 minute read

Mean teachers are better role models: Weight-averaged consistency targets improve semi-supervised deep learning results(2017)

Updated: January 11, 2021

1 minute read

Temporal Ensembling For Semi-Supervised Learning(2016)

Updated: January 9, 2021

2 minute read

Back to top ↑

Visualizing 1

Visualizing the Loss Landscape of Neural Nets

Updated: August 1, 2021

9 minute read

Back to top ↑

Self Training 1

Rethinking Pre-training and Self-training(2020)

Updated: July 13, 2020

3 minute read

Self-training with Noisy Student improves ImageNet classification(2019)

Updated: July 8, 2020

4 minute read

Back to top ↑

Augmentation 1

RandAugment: Practical automated data augmentation with a reduced search space

Updated: June 24, 2021

5 minute read

AutoAugment: Learning Augmentation Strategies from Data

Updated: June 23, 2021

5 minute read

Back to top ↑